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CLAIMS 

1. (Currently Amended) A method of identifying a caller of a call from the caller to a 
recipient, the method comprising: 

(a) receiving a voice input from the caller; 

(b) applying characteristics of the voice input to a plurality of acoustic models, which 

comprises a generic acoustic model and acoustic models of any previously 
identified callers, to obtain a plurality of respective acoustic scores, wherein the 
generic acoustic model comprises caller independent models of a plurality of 
speech units; 

(c) identifying the caller as one of the previously identified callers or a new caller based 

on the plurality of acoustic scores; and 

(d) if the caller is identified as a new caller in step (c), generating a new acoustic model 

for the new caller from the caller-independent models of the generic acoustic 
model and modifying the caller-independent models of the speech units that are 
included in the voice input to represent the characteristics of the voice input 
received from the new caller such that the new acoustic model for the new caller 
and the acoustic models of any previously identified callers comprise the same 
plurality of speech units ; and 

(e) generating a user prompt, which identifies certain ones of the acoustic models of 

previously identified callers for merging together . 

2. (Original) The method of claim 1 wherein identifying in (c) comprises: 

(c)(1) identifying the caller as one of the previously identified callers if the acoustic score 
for the respective acoustic model is better than the acoustic score for the generic 
acoustic model; and 

(c)(2) identifying the caller as a new caller if the acoustic score for the generic acoustic 
model is better than the acoustic scores for the acoustic models of the plurality of 
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previously identified callers. 

3. (Previously Presented) The method of claim 1 wherein: 

step (a) comprises segmenting the voice input into a sequence of recognized speech units 

using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units segmented 

in step (a); and 

step (b) comprises applying the characteristics of the voice input to a sequence of the 
models of the speech units segmented in step (a) for the plurality of acoustic 
models. 

4. (Currently Amended) The method of claim 1 wherein the method further comprises: 

(e) (0 if the caller is identified as one of the previously identified callers in step (c), 
updating the respective acoustic model for the previously identified caller by 
modifying the models of the speech units that are included in the voice input, 
based on the characteristics of the voice input. 

5. (Currently Amended) The method of claim 4 wherein step (e) (f) comprises modifying 
the models of the speech units that are included in the voice input based on as little as a single 
utterance. 

6. (Currently Amended) The method of claim 1 and further comprising: 

(e) (f) storing the new acoustic model in an acoustic model repository with the plurality of 
acoustic models such that the new acoustic model becomes one of the plurality of 
acoustic models in step (b) and the new caller is included as a previously 
identified caller. 



7. (Canceled). 
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8. (Original) The method of claim 1 wherein steps (a) through (c) are performed without 
alerting the caller during the call that the caller is being identified. 

9. (Original) The method of claim 1 wherein: 

step (b) comprises splitting the voice input into subsections and applying the 
characteristics of each subsection to the plurality of acoustic models to obtain a 
plurality of respective acoustic scores that represent how well the characteristics 
in each subsection match the respective acoustic models; and 

step (c) comprises, for each subsection, identifying the acoustic model having the best 
acoustic score for that subsection and identifying the caller as one of the 
previously identified callers only if the best acoustic scores for all subsections 
correspond to the same previously identified caller. 

10. (Currently Amended) The method of claim 1 and further comprising: 

(e) (0 maintaining a caller-specific language model for each of the previously identified 

callers based on the voice inputs from those callers; 

(f) {g} applying the characteristics of the voice input to the generic acoustic model and 

each of the caller-specific language models to produce a plurality of recognized 
speech unit sequences; 

(g) (h) choosing the recognized speech unit sequence that has a highest probability 

relative to probabilities of the other recognized speech unit sequences; and 

(h) {i} identifying the caller based at least in part on the recognized speech unit sequence 

having the highest probability. 

11. (Currently Amended) The method of claim 10 and further comprising: 

(i) {j) if the caller identified in step (h) {i} is different than the caller identified in step (c), 

generating a user prompt of at least one of th e following member of the group 
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comprising : the voice input, the recognized speech unit sequence, the identified 
callers, the acoustic model of the caller identified in step (c), and of the caller- 
specific language model of the caller identified in step (h). 

12. (Currently Amended) The method of claim 1 and further comprising: 

(e) (f) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag the certain acoustic models for merging together. 



13. Canceled. 



14. (Currently Amended) A system for identifying a caller of a call from the caller to a 
recipient, the system comprising: 

a receiver for receiving a voice input from the caller; 

an acoustic model repository comprising a plurality of acoustic models, including a 
generic acoustic model and acoustic models of any previously identified callers, 
wherein the generic acoustic model comprises caller-independent models of a 
plurality of speech units; 

means for applying characteristics of the voice input to the plurality of acoustic models to 
produce a plurality of respective acoustic scores; 

means for identifying the caller as one of the previously identified callers or a new caller 
based on the plurality of acoustic scores; and 

acoustic model generator means for generating a new acoustic model for the new caller if 
the acoustic score for the generic acoustic model is better than the acoustic scores 
for the acoustic models of the plurality of previously identified callers, wherein 
the acoustic model generator means generates the new acoustic model from the 
caller-independent models of the generic acoustic model and modifies the caller- 
independent models of the speech units that are included in the voice input to 
represent the characteristics of the voice input received from the new caller such 
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that the new acoustic model for the new caller and the acoustic models of any 
previously identified callers comprise the same plurality of speech units ; and 
means for generating a user prompt, which identifies certain ones of the acoustic models 
of previously identified callers for merging together . 

15. (Original) The system of claim 14 and further wherein: 

the system further comprises a speech recognizer, which segments the voice input into a 

sequence of recognized speech units using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units recognized 

by the speech recognizer; and 
the means for applying comprises means for applying the characteristics of the voice 

input to a sequence of the models of the speech units segmented by the speech 

recognizer for the plurality of acoustic models. 

16. (Previously Presented) The system of claim 14 wherein: 

the system further comprises an acoustic model updating module, which if the caller is 
identified as one of the previously identified callers, updates the respective 
acoustic model for the previously identified caller by modifying the models of the 
speech units that are included in the voice input, based on the characteristics of 
the voice input. 

17. (Original) The system of claim 16 wherein the acoustic model updating module is capable of 
modifying the models of the speech units that are included in the voice input based on as little as 
a single utterance from the caller. 

18. (Original) The system of claim 14 wherein the acoustic model generator means stores the 
new acoustic model in the acoustic model repository such that the new acoustic model becomes 
one of the plurality of acoustic models and the new caller is included as a previously identified 
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caller. 

19. (Canceled). 

20. (Original) The system of claim 14 wherein system is configured to receive the voice input 
and identify the caller without alerting the caller during the call that the caller is being identified. 

21. (Original) The system of claim 14 wherein: 

the means for applying comprises means for splitting the voice input into subsections and 
applying the characteristics of each subsection to the plurality of acoustic models 
to obtain a plurality of respective acoustic scores that represent how well the 
characteristics in each subsection match the respective acoustic models; and 

the means for identifying comprises, for each subsection, means for identifying the 
acoustic model having the best acoustic score for that subsection and means for 
identifying the caller as one of the previously identified callers only if the best 
acoustic scores for all subsections correspond to the same previously identified 
caller. 

22. (Original) The system of claim 14 and further comprising: 

a language model repository for storing a caller-specific language model for each of the 
previously identified callers based on the voice inputs from those callers; 

means for applying the characteristics of the voice input to the generic acoustic model and 
each of the caller-specific language models to produce a plurality of recognized 
speech unit sequences; and 

means for choosing the recognized speech unit sequence that has a highest probability 
relative to probabilities of the other recognized speech unit sequences, wherein the 
means for identifying identifies the caller based at least in part on the recognized 
speech unit sequence having the highest probability. 
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23. (Currently Amended) The system of claim 22 wherein the means for identifying comprises 
means for generating a user prompt of at least one of the following member of the group 
comprising : (1) the voice input, the recognized speech unit sequence having the highest 
probability, (2) the caller-specific language model producing the recognized speech unit 
sequence having the highest probability, and (3) the acoustic model having the best acoustic 
score, if the caller-specific language model having the highest probability corresponds to a 
different caller than the acoustic model having the best acoustic score in (3). 

24. (Currently Amended) The system of claim 14 and further comprising: 

means for flagging the certain acoustic models for merging together based on a distance 
measure between the plurality of acoustic models. 

25. Canceled. 

26. (Currently Amended) A computer-readable medium comprising computer-executable 
instructions that, when executed by a computer, performs the method comprising: 

(a) receiving a voice input of a call from a caller; 

(b) applying characteristics of the voice input to a plurality of acoustic models, which 

comprises a generic acoustic model and acoustic models of aay previously 
identified callers, to obtain a plurality of respective acoustic scores that represent 
how well the characteristics match the respective acoustic models, wherein the 
generic acoustic model comprises caller-independent models of a plurality of 
speech units; and 

(c) identifying the caller as one of the previously identified callers or a new caller based 

on the plurality of acoustic scores^-and 

(d) if the caller is identified as a new caller in step (c), generating a new acoustic model 

for the new caller from the caller-independent models of the generic acoustic 
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model and modifies the caller-independent models of the speech units that are 
included in the voice input to represent the characteristics of the voice input 
received from the new caller such that the new acoustic model for the new caller 
and the acoustic models of any previously identified callers comprise the same 
plurality of speech units ; and 
(e) generating a user prompt, which identifies certain ones of the acoustic models of 
previously identified callers for merging together . 

27. (Original) The computer-readable medium of claim 26 wherein: 

step (a) comprises segmenting the voice input into a sequence of recognized speech units 

using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units segmented 

in step (a); and 

step (b) comprises applying the characteristics of the voice input to a sequence of the 
models of the speech units segmented in step (a) for the plurality of acoustic 
models. 

28. (Currently Amended) The computer-readable medium of claim 26 wherein the method 
further comprises: 

(e) (D if the caller is identified as one of the previously identified callers in step (c), 
updating the respective acoustic model for the previously identified caller by 
modifying the models of the speech units that are included in the voice input, 
based on the characteristics of the voice input. 

29. (Currently Amended) The computer-readable medium of claim 26 wherein the method 
further comprises: 

(a) £f) storing the new acoustic model in an acoustic model repository with the plurality of 
acoustic models such that the new acoustic model becomes one of the plurality of 
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acoustic models in step (b) and the new caller is included as a previously 
identified caller. 

30. (Canceled). 

31. (Currently Amended) The computer-readable medium of claim 26 wherein the method 
further comprises: 

(e) <QQ maintaining a caller-specific language model for each of the previously identified 

callers; and 

(f) (g) identifying the caller based at least in part on probabilities of recognized speech 

unit sequences produced by the caller-specific language models from the voice 
input. 

32. (Currently Amended) The computer-readable medium of claim 31 wherein the method 
further comprises: 

(g) (h) if the caller identified in step (f) (g) is different than the caller identified in step 

(c), generating a user prompt of at least one of th e following member of the group 
comprising : the voice input, the recognized speech unit sequence, the identified 
callers, the acoustic model of the caller identified in step (c), and of the caller- 
specific language model of the caller identified in step (f) £g). 

33. (Currently Amended) The computer-readable medium of claim 26 wherein the method 
further comprises: 

(e) (f) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag the certain acoustic models for merging together. 



34. Canceled. 
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35. (Currently Amended) A computer-implemented method of identifying a caller of a call 
from the caller to a recipient, the method comprising: 

(a) receiving a voice input; 

(b) segmenting the voice input into a sequence of recognized speech units using a caller- 

independent, generic acoustic model, wherein the generic acoustic model 
comprises caller-independent models of a plurality of speech units; 

(c) applying characteristics of the voice input to a sequence of speech unit models of the 

recognized speech units within a plurality of acoustic models, which comprises 
the generic acoustic model and acoustic models of any previously identified 
callerst-and 

(d) identifying the caller as one of a plurality of previously identified callers or as a new 

caller based on how well the characteristics of the voice input fit the plurality of 
acoustic models, and if the caller is identified as a new caller, generating a new 
acoustic model for the new caller from the generic acoustic model by modifying 
the speech unit models of the speech units that are included in the voice input to 
represent the characteristics of the voice input received from the new caller such 
that the new acoustic model for the new caller and the acoustic models of any 
previously identified callers comprise the same plurality of speech units; 

(e) maintaining a caller-specific language model for each of the previously identified 

callers based on the voice inputs from those callers; 

(f) applying the characteristics of the voice input to the generic acoustic model and each 

of the caller-specific language models to produce a plurality of recognized speech 
unit sequences; 

(g) choosing the recognized speech unit sequence that has a highest probability relative 

to probabilities of the other recognized speech unit sequences; 

(h) identifying the caller based on the recognized speech unit sequence having the 

highest probability; and 
(0 if the caller identified in step (h) is different than the caller identified in step (d\ 
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generating a user prompt of at least one member of the group comprising: the 
voice input, the recognized speech unit sequence, the identified callers, the 
acoustic model of the caller identified in step (d\ and of the caller-specific 
language model of the caller identified in step (h) . 

36. (Currently Amended) The method of claim 35 and further comprising: 

(e) (j) storing the new acoustic model in an acoustic model repository with the plurality of 
acoustic models such that the new acoustic model becomes one of the plurality of 
acoustic models in step (c) and the new caller is included as a previously 
identified caller. 

37. Canceled. 



38. (Currently Amended) The method of claim 35 wherein the method further comprises: 

(e) (j) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag certain acoustic models for merging together. 



